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Abstract. The gamma-ray bursts detected by the BATSE instrument may be separated into "short", "inter- 
mediate" and "long" subgroups. Previous statistical tests found an anisotropic sky-distribution on large angular 
scales for the intermediate subgroup, and probably also for the short subgroup. In this article the description and 
the results of a further statistical test - namely the nearest neighbour analysis - are given. Surprisingly, this test 
gives an anisotropy for the long subgroup on small angular scales. The discussion of this result suggests that this 
anisotropy may be real. 
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1. Introduction 

The separation of the gamma-ray bursts (hereafter GRBs) 

- detected by the BATSE instrum ent (|Meegan et al. 2000D 

- into subgroups is done IjKouveliotou et al. 1 993 
IHorvath 19981 |Mukherjee et al. 1998j ) with respect to T 90 , 
the duration during which 90% of the radiation of a burst 
is measured). Bursts are either short (T90 < 2 s), or in- 
termediate (2 s < T 90 < 10 s), or long (T 90 > 10 s). 
Nowadays it is practically sure that the long and short 
subgroups are different phenomena IjNorris et al. 20011 
IHorvath et al. 20011) . The situation concerning the inter- 
mediate subgroup is unclear; some authors query even the 
reality of this subgroup itself (Hakkila et al. 2000|l . From 
the afterglow data the cosmological origin is directly con- 
firmed for the long bursts only. They are usually at high 
redshifts. For the short and intermediate GRBs there is 
only indirect evidence for a cosmological origin, and con- 
crete redshifts are unknown (for a survey of these ques- 
tions see IMeszaros P. 200 If) . 

During the last years one of the authors to- 
gether with various collaborators provided several 
statistical tests verifying the isotropy in the angu- 
lar distribution of GRBs. These tests were based 
on the binomial distribution (Meszaros A. 1997 
Bal azs et al. 19981 IBalazs et al. 1999 ;. on spherical 
harmonics (Meszar os A. et al. 200 0a). on the counts-in- 
cells method (Meszar os A. et al. 2000 b). on the two-point 
angular correlation function IjMeszaros A. et al. 2000cjl . 
and on multifractal methods l|Vavrek et al. 200T)l . These 
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tests (for a summary see IMeszaros A. et al. 2001|) give 
an anisotropy for the intermediate subgroup. The short 
subgroup also seems to be distributed anisotropically; nev- 
ertheless, there are only a few tests that reject isotropy at 
a high enough confidence level. The long subgroup seems 
to be distributed isotropically; here only the test based 
on the two-point angular correlation function rejects the 
null hypothesis of isotropy (Meszar os A. et al. 2000c|) . 
Recently and fully independently these results were 
confirmed by Litvin et al. (2001). 

In this paper we present the results of a new test; 
namely of the nearest neighbour analysis (hereafter NNA). 
This test (IScott fc Tout 1 989) is a standard statistical 
test, and - as far as known - has not been used yet for 
GRBs. 

The paper is organized as follows. In Sect. 2 the 
method is described. Sect. 3 compares this method with 
other methods. Sect. 4 presents the results of the test. 
Sect. 5 discusses and summarizes the conclusions of the 
paper. 



2. The method 

NNA is a standard statistical test, which compares the 
actual angular distances among the objects on the surface 
of a sphere having unit radius with the theoretical angu- 
lar distances in a randomly and isotropically distributed 
sample. 

The theory of NNA was formulated by Scott & Tout 
(1989). Here this theory is only recapitulated and specified 
for the sky distribution of GRBs. 
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Let there be N objects on the sky, which are dis- 
tributed randomly. N should be > 2; for our purpose we 
may assume N ^> 1. We arbitrarily choose one object 
("first object"). At an angular distance (3 (0 < (3 < tt) 
from the first object we define an infinitesimal belt with 
thickness df3. This belt is defined by the distance interval 
[/3, ([3 + dp)]. The probability of having (L — 1) objects at 
the distance < (3, and one object in the infinitesimal belt 
is given by 

(N- 1)! 

PLmp = 2^(N-L-iy.( L -iy x 

sin/3(l-cos/3) i - 1 (l +cos/3) Ar - i - 1 d/3. (1) 

L can be = 1, 2, (N — 1). For L — 1 the nearest (or: the 
first nearest) object lies in the belt, for L = 2 the second 
nearest object lies in the belt, etc. 

The integral probability L pL(f3')d(3' = Pl((3) defines 
the probability that there are L and exactly L objects in 
the neighbourhood of the first object; the neighbourhood 
is defined by the distances < (3. Pl{k) = 1, as it should 
be. For L = 1 and L = 2 one obtains 

/ X \ N ~ 1 

nm = i - (i - _) , ,2) 

and 



N-2\ (N — 1)(N -2) J w 

respectively, where X = (AT-1) sin 2 (/3/2), and X 2 = (N- 
l)(N-2) sin 2 f . The introd uction of X and X 2 - instead o f 
(3 - simplifies the formulas ijSlechta fc Meszaros A. 1997|l . 

If the integral probabilities for different L are given 
by these analytical formulas, we can use them as the- 
oretical cumulative probability distributions. The stan- 
dard Kolmogorov-Smirnov test can be used to compare 
them with the measured empirical cumulative distribu- 
tions ({Press et al. 19921 Chapt.14.3). 

The test for L = 1 should be done as follows. The the- 
oretical function Pi(X) is defined for < X < (N - 1) 
and is monotonously increasing from to 1 . The measured 
(N— 1) first nearest neighbour distances are sorted into in- 
creasing sequence. Then, for any distance in this sequence, 
one obtains a value X, and at this value the empirical cu- 
mulative distribution function is increased by the value 
(N — 1) . Hence the empirical cumulative function also 
runs from to 1 for the same range of X. The maximal 
absolute value D between the two different cumulative dis- 
tribution functions defines the significance level for given 
(N - 1) l|Press et al. 19921 Chapt.14.3). 

One can do (N— 1) tests, because the test may be done 
for any allowed L. For any L the theoretical cumulative 
function can be calculated and is an analytical function. 



The (N—l) L-th nearest neighbour measured angular dis- 
tances can also be obtained from the positions of objects. 

However, one does not need to provide all possible (A— 
1) tests for one sample. Let us assume that the test is 
made for L = 1, then for L = 2, then for L = (N — 
1). If one assumes that the null hypothesis is true, viz. 
that the objects are distributed randomly on the surface 
of the sphere, then no test should reject this hypothesis. 
Nevertheless, it is unlikely that going to bigger and bigger 
L will give any new result. This may be seen as follows. 

There are N(N — l)/2 distances among the observed 
N objects. But only (2A — 3) distances are independent. 
(N objects on the sphere are defined by 2A" coordinates 
[any object is defined by two spherical angles, e.g. by ■& 
and ip). Nevertheless, one object may be taken - without 
loss of generality - at the pole [i.e. one may have -0 = 
for this "first object"]. A second arbitrary object may still 
be at (p = 0. There are (N — 1) independent distances 
known immediately: they are the $ coordinates of (N — 1) 
objects giving the distances from the first object. There 
are a further (N — 2) independent distances from the sec- 
ond object. They can be calculated if the <p coordinates 
are known. Then any further distances can be calculated 
from these (2A^ — 3) independent distances.) 

This means that, if one uses only L = 1 and L = 2, 
then one uses 2A^ distances in these two tests, which is 
practically identical to the number of independent dis- 
tances. (For N ^> 1 the difference between 2N and 
(2A — 3) is negligible.) Therefore, we will only use the 
L = 1 and L — 2 tests, but not higher L. Once the null 
hypothesis is rejected by either the L — 1 or L = 2 test 
at a given significance level, then this rejection is correct. 
Only the significance level obtained provides a lower limit 
for this rejection, because it is still possible that some fur- 
ther tests with higher L will reject the null hypothesis at 
a higher significance level. 

There are two problems with the application of this 
test. One problem is general and the second is a special 
problem occurring for the BATSE data of GRBs. 

The first problem is the following. To compare the 
theoretical curve with the empirical curve one needs A^ 
measured independent L-th nearest distances. We have 
only one single sample: the actual distribution of N ob- 
jects on the sky. In it one has N L-th nearest neighbour 
distances; for any object one calculates the L-th nearest 
neighbour. But these A^ distances need not be indepen- 
dent, because some distances may occur twice. (If the L- 
th nearest neighbour for k-th object is the m-th object, 
then it may well happen that the L-th nearest neighbour 
for m-th object is the k-th object.) Fortunately, this is not 
an essential defect excluding its use i|Scott & Tout 1989| . 

The second problem concerns the case of GRBs alone. 
It follows from the fact that the sky is not covered 
uniformly by the BATSE instrument. There is a sky- 
exposure function g(S) that depends on the declina- 
tion S ( |Meegan et al. 2000| ). Therefore, the theoretical 
curve from Eq. (2). and Eq. (3). is not usable at once. 
Fortunately it is easy to take this into account. (In the 
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previous papers cited in Sect. 1 the effect of a non-uniform 
sky-exposure function could also be corrected for. Hence, if 
we discuss anisotropy, we always mean intrinsic anisotropy 
in the distribution of GRB not caused by the BATSE in- 
strument.) 

This correction may be made as follows. Let there be 
a burst at declination 5. Then one can always introduce a 
new declination S unambiguously by the relation 

g(6') cos 6' dS' = A cos 8' d5\ (4) 

-7T/2 J-n/2 

where 

,7T/2 

A= / g(6') cos 6' dS', (5) 

J -it/2 

This means that - formally - the declination is "shifted" 
to a new value. If there is an intrinsic isotropy in the 
distribution of GRBs, the GRBs should also be dis- 
tributed isotropically in the new "shifted" coordinates. 
Note that this shift is a standard method in Statistics 
flTrumpler fc Weaver 19531 Chapt.1.13). Note also that 
this elimination of the non-uniform sky-exposure function 
is new. 

Because one should obtain isotropy in these "shifted" 
coordinates - if there is intrinsic isotropy - any statisti- 
cal test that assumes uniform sky exposure can be used. 
Therefore NNA is also usable. We will provide the NNA 
test for L = 1 and L = 2 in these "shifted" coordinates 
for the short, intermediate and long GRBs, respectively. 

3. Comparison of the method with other tests 

In this section the advantages and disadvantages of NNA 
are summarized and compared with other tests mentioned 
in Sect. 1. 

First of all, we want to remark that NNA is highly sim- 
ilar to the two-point angular correlation function method. 
In both cases the key idea is the same: There are N(N — 
l)/2 angular distances among N objects, and these mea- 
sured angular distances are compared with the theoreti- 
cally expected distances following from the random angu- 
lar distribution of N objects. 

Both methods have the great advantage that they are 
independent of the choice of coordinate systems and also 
of other artificial choices. (For example, in the counts-in- 
cells method the boundaries of the cells must be chosen 
ad hoc (Meszaros A. et al. 2000b1) . No such ad hoc choice 
is needed here.) A further advantage of these methods 
lies in the fact that they are also able to detect even- 
tual anisotropies on small angular scales. (If there are N 
(N ^S> 1) objects on the sky, then the angular scales - 
measured in radians - are large, if these scales are much 
bigger than ~ y^in/N ; small angular scales in radians cor- 
respond with distances ~ \J An/N ; for further details see 
Scout & Tout (1989).) This is a great advantage of NNA 
compared, e.g., with the counts- in-cells method, which is 
insensitive to angular scales much smaller than the cell size 



(inside a cell eventual anisotropies may be "smoothed" ; for 
more details see Meszaros et al. (2000b)). In principle, the 
method based on spherical harmonics, and multifractal 
methods are sensitive to small scales, too. Nevertheless, on 
scales ~ v/ 'Att/N radians these methods begin to have sev- 
eral technical problems (see, e.g., Meszaros et al. (2000a) 
for further details). All this means that NNA and the 
method based on the two-point angular correlation func- 
tion may well detect anisotropies on scales ~ y/ Att/N ra- 
dians, which were not detected yet by other methods. 

There are two essential differences between NNA 
and the method based on the two-point angular corre- 
lation function. The first concerns the number of dis- 
tances used: NNA uses only 2N angular distances, but 
the second method uses N(N — l)/2 distances. The sec- 
ond concerns the procedure of the calculation of signif- 
icance levels: The method based on the two-point an- 
gular correlation function needs Monte-Carlo simulations 
HMeszaros A. et al. 2000c)l : NNA does not need them. It 
is well known that one has to be careful using pseudo- 
random generators and Monte-Carlo simulations (see, e.g., 
Chapter 7.0 of Press et al. (1992)). Therefore it is essen- 
tial to use a method that does not need these pseudo- 
random simulations. On the other hand, this advantage 
of NNA is lessened by the fact that it uses only 2N dis- 
tances. This means that NNA may miss anisotropies de- 
tected by the correlation function method. On the other 
hand, any anisotropy detected by NNA method should 
also be detected by the correlation function method. In 
other words, NNA is not as powerful as the correlation 
function method. Its advantage is given mainly by its 
simplicity and by the fact that it does not need pseudo- 
random generations. In any case, its use in the case of 
GRBs is certainly justified. 

Add to this that, as it is well-known in Statistics 
dTrumpler fc Weaver 1953| ), it is quite usual for different 
tests to give different conclusions. (Different tests give dif- 
ferent "trials".) Some tests may reject the hypothesis of 
isotropy while further tests do not reject it. In addition, 
if two different tests reject it, then this rejection may oc- 
cur at different significance levels, too. Trivially, if there 
is an isotropic distribution, then no test should reject the 
hypothesis of this isotropy. This means that, at least in 
principle, one single test rejecting the isotropy is enough to 
proclaim the existence of anisotropy. On the other hand, 
using several tests is clearly better, because if only one 
single test rejects the isotropy one can never exclude with 
certainty that there may not have been some unknown 
technical problems (pseudo-random simulations, unknown 
instrumental effects, unknown systematic errors in mea- 
surements, etc.). 

In fact this is the situation also for long GRBs (see 
Sect. 1): Isotropy of both the short and intermediate sub- 
groups of GRBs is rejected by several tests, and hence 
it may be stated that these subgroups are distributed 
anisotropically on the sky. On the other hand, the isotropic 
distribution of the long-GRB subgroup is rejected by one 
single test only; hence its anisotropy is questionable still. 
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X 2 

Fig. 1. Kolmogorov-Smirnov test for the second nearest 
neighbour for the long subgroup containing N = 1239 
GRBs. The theoretical curve is the "smooth" curve; the 
empirical curve is a step function increasing in steps of 
1/1239. The biggest difference D between the two curves 
is at around X2 — 1500, where the theoretical curve is at 
the bottom. 



4. The results 

There are 2702 GRBs in the BATSE Catalog 
flMeegan et al. 2000| > , and from them 2037 GRBs 
have measured Tgo. They are separated into three sub- 
groups: 497 GRBs having Tgo < 2 s comprise the " short" 
subgroup; 301 GRBs having 2 s < Tgo < 10 s comprise 
the "intermediate" subgroup; and 1239 GRBs having 
Tgo > 10 s comprise the "long" subgroup. Because the 
existence of the third intermediate subgroup is not sure 
yet (see Sect. 1), for safety we also test the "non-short" 
subgroup contaning 1239 + 301 = 1540 GRBs having 
Tgo > 2 s (i.e. the "intermediate" and "long" subgroups 
are taken together) . For the sake of completeness we also 
tested the sample of "all" GRBs, containing 2702 GRBs. 
For any sample we provide the nearest neighbour and 
second nearest neighbour analysis using the standard 
Kolmogorov-Smirnov test. For any sample we take the 
bigger D from the L = 1 and L = 2 tests. 

Table 1 shows the results. 

Table 1 gives the remarkable result: for the long sub- 
class, and only for this subclass, is the null-hypothesis of 
isotropy rejected at the usual > 95% significance level (viz. 
99%). This result follows from the L = 2 test. 

Both the theoretical and empirical cumulative distri- 
bution functions for L = 2 are shown in Fig. 1. From this 
it follows that around X2 — 1500, i.e. around j3 ~ 0.6 radi- 
ans ~ (3 — 4) degrees, the empirical curve lies significantly 
above the theoretical one. This means that on this angu- 
lar scale the actual angular distribution of GRBs shows 
an overdensity compared with the random isotropic case. 



Table 1. The results of the Kolmogorov-Smirnov tests 
for the three subclasses. The first column denotes the sub- 
class; the second the number of GRBs of this subclass; the 
third column gives the difference D between the theoreti- 
cal and empirical cumulative function; the fourth column 
gives the significance (in percentage) of the rejection of 
the null-hypothesis of isotropy (Eq. 14.3.9 of Press et al. 
(1992)). 



subgroup 


N 


D 


% 


short 


497 


0.049 


82.1 


intermediate 


301 


0.054 


76.5 


long 


1239 


0.046 


99.0 


non — short 


1540 


0.021 


51.2 


all 


2702 


0.021 


77.2 



5. Discussion and conclusion 

If one compares the previous tests surveyed in Sect. 1 with 
Table 1, it seems that the occurrence of anisotropy for the 
long subgroup, and for this subgroup only, is a surprising 
result. 

Nevertheless, nothing unexpected is occurring here. 
First, NNA is sensitive on small angular scales (for the 
long subgroup on ~ yj An/N = 0.2 radians). Previous 
tests were sensitive on large angular scales, so it is not 
strange to detect anisotropy on angular scales of a few de- 
grees. Second, the two-point angular correlation function 
shows anisotropy for this subgroup, too (see the survey in 
Sect. 1). (As is noted in Sect. 3, it is to be expected that 
anisotropy detected by NNA should be detected by the 
method based on the correlation function, too; the oppo- 
site does not hold). Third, from Eq. 14.3.9 of Press et al. 
(1992) it follows that the significance level increases with 
Dy/N. From Table 1 one sees that D is more or less the 
same for the first three subclasses; hence, the long sub- 
class is anisotropic due to high N. It is therefore quite 
possible that the anisotropies for short and intermediate 
subclasses respectively are not detected by NNA due to 
the small N. Fourth, visual inspection of the distribution 
of long GRBs on the sky (Fig. 2) also shows some group- 
ing on small angular scales. Fifth, in fact some anisotropy 
on degree scales is expected from Cosmology: at z <C 1 
[z is the redshift) the distribution of galaxies and other 
objects is inhomogeneous on scales ~ (10 — 100) Mpc 
(see [Meszaros A. 19971 and references therein). At high z, 
where the long GRBs dominantly are, these scales corre- 
spond to angular scales of some degrees. (Direct measure- 
ments from GRB afterglows give z = 4.5 for the maximal 
redshift (Meszaros P. 2001); indirect observational data 
also allow z ~ 20 i|Meszaros A. fc Meszaros P. 1 9961.) 
Hence, if the present-day inhomogeneities exist also at 
the high redshifts and if the distribution of long GRBs 
reflects the distribution of matter at these high redshifts, 
then anisotropy of long GRBs on degree scales is quite 
possible. 
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All this means that the occurrence of anisotropy for 
the long subgroup and the simultaneous non-occurrence of 
anisotropy for the intermediate and short subgroup is not 
strange. On the other hand, the isotropy of the non-short 
sample is remarkable. It seems that GRBs from intermedi- 
ate and long subgroups separately are anisotropically dis- 
tributed; but differently and that their anisotropies "can- 
cel" . It is urgent to clarify the status of the intermediate 
subgroup - namely whether is it a real different subgroup 
or not (see Sect. 1). Our result, together with Horvath's 
recent results (Horvath 2002), suggest that the intermedi- 
ate subclass may be a real subgroup. 

The result concerning the sample denoted by "all" has 
no real meaning, because the short and long subclasses 
may be different phenomena (see Sect. 1). 

As was already noted, theoretically (see Sect. 3), it is 
sufficient to have one single rejection of the null hypoth- 
esis from one single statistical test. In practice, the result 
is more reliable if several tests reject the null hypthesis. 
Concerning the long subgroup this is what we get here, 
because both NNA and the correlation function method 
reject the isotropy. Hence this paper may lead to the con- 
clusion that the long bursts are also distributed anisotrop- 
ically. 

Nevertheless, the positive result of this paper, together 
with a similar result from the angular correlation function 
is encouraging, but not yet definite; we still have to be 
very cautious, for a number of reasons: 

— These two methods use the same idea: the measured 
angular distances are compared with what would be 
theoretically expected for an isotropic sample. Hence, 
these two methods cannot be considered as fully inde- 
pendent statistical tests. 

— As was noted, the use of NNA has a general prob- 
lem, because some distances may occur twice. Hence, 
in principle, the results of this test alone need not be 
final. 

— The rejection of isotropy on small angular scales may 
by caused - at least in principle - by a wholly differ- 
ent phenomenon; even if the long bursts are distributed 
isotropically, but some bursts are occurring at the same 
position several times (i.e. if some bursts are repeat- 
ing), then the NNA test may give a positive result. This 
mixing of two effects is discussed, e.g., by Brainerd 
(1996). Nowadays it is practically certain that for the 
long bursts no repetition occurs. This follows mainly 
from the models of these GRBs - they always assume a 
total destruction of source (see, e.g, Meszaros P. (2001) 
for a survey of models). On the other hand, artificial in- 
strumental effects cannot be excluded yet for the long 
bursts. It is in principle possible that a long burst is de- 
tected again by the BATSE instrument after one or two 
orbits of the satellite. Then the new detection would 
be included in the Catalog as a new burst, but one 
would have in fact one single burst (V. Connaughton, 
private communication). Hence, strictly from a statis- 
tical point of view, instrumental repetition is not ex- 



cluded yet definitely. In order to discuss this possibility 
we have searched for pairs of GRBs in the sample de- 
noted "all", in which: A. the later GRB occurred 4 
hours after the first event, B. the angular distance be- 
tween them is less than 10 degree, C. at least one GRB 
from the pair should have Tgo > 10 s. Only three pairs 
were found that fulfilled all these requirements. Their 
BATSE trigger numbers are: 5648-5649, 6165-6166 and 
7359-7360. In addition, from these six GRBs only three 
(5648, 6165, 7360) belong to the long subclass; hence, 
there is no pair in which both GRBs belong to the long 
subgroup. It is highly questionable that in these three 
cases instrumental repetition occurs. But, even assum- 
ing this, if one deletes three GRBs from the "long" 
sample containing 1239 GRBs, it does not make an es- 
sential change in the sample: the effect of instrumental 
repetition can hardly have any importance for our con- 
clusions. 

— The non-short sample does not show anisotropy. This 
means that, if the existence of a third intermedi- 
ate subgroup were not confirmed in the future (this 
is still possible even after Horvath's recent paper 
IjHorv ath 2002))) , then the anisotropy obtained in this 
paper would be an interesting and remarkable result 
but not a proof of the anisotropy of a separate physi- 
cally well-defined subclass of GRBs. 

— The positional errors of GRBs are large (a few de- 
grees), and are comparable with the angular scale of 
the expected anisotropy. Nevertheless, an earlier study 
( |Tegmark et al. 199 6 ) shows that these positional er- 
rors cancel on average, and hence their effect need not 
be important. 

Keeping all this in mind, we conclude that the 
anisotropy of the long subclass at a > 99% significance 
level may be real. However, to reach a more definite con- 
clusion several further independent tests are still needed. 

We are aiming to provide such tests on BATSE data 
in future. We also hope that the results of this paper - 
together with those of the earlier papers - will also en- 
courage others to provide independent statistical tests on 
the angular distribution of GRBs. 
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